Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 12621960 |
| Missing cells | 15275736 |
| Missing cells (%) | 6.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 GiB |
| Average record size in memory | 145.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 8 |
| Boolean | 2 |
Category is highly overall correlated with Exclude and 2 other fields | High correlation |
Exclude is highly overall correlated with Category and 3 other fields | High correlation |
Motif is highly overall correlated with Moving_Quickly and 3 other fields | High correlation |
Moving_Quickly is highly overall correlated with Exclude and 3 other fields | High correlation |
Predominant_Behavior is highly overall correlated with Category and 4 other fields | High correlation |
Secondary_Descriptor is highly overall correlated with Category and 4 other fields | High correlation |
centroid_x is highly overall correlated with rat | High correlation |
centroid_y is highly overall correlated with rat | High correlation |
distance is highly overall correlated with speed | High correlation |
group is highly overall correlated with rat_id | High correlation |
motif_hmm_650 is highly overall correlated with Motif | High correlation |
rat is highly overall correlated with centroid_x and 2 other fields | High correlation |
rat_id is highly overall correlated with group and 1 other fields | High correlation |
speed is highly overall correlated with distance | High correlation |
in_center is highly imbalanced (66.6%) | Imbalance |
Exclude is highly imbalanced (53.1%) | Imbalance |
Moving_Quickly has 5166976 (40.9%) missing values | Missing |
Predominant_Behavior has 909134 (7.2%) missing values | Missing |
Secondary_Descriptor has 8290492 (65.7%) missing values | Missing |
Category has 909134 (7.2%) missing values | Missing |
rat is uniformly distributed | Uniform |
motif_hmm_650 has 415439 (3.3%) zeros | Zeros |
Motif has 415439 (3.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-01-30 01:50:21.784797 |
|---|---|
| Analysis finished | 2024-01-30 02:17:10.082859 |
| Duration | 26 minutes and 48.3 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
file_name
Text
| Distinct | 468 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 57 |
| Mean length | 57.692308 |
| Min length | 57 |
Characters and Unicode
| Total characters | 728190000 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 |
|---|---|
| 2nd row | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 |
| 3rd row | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 |
| 4th row | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 |
| 5th row | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 |
| Value | Count | Frequency (%) |
| 22-01-10_baseline_1_djl_tabb_cropped_crf0_0min_to_15min_rat1 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tgbh_cropped_crf0_0min_to_15min_rat3 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tabb_cropped_crf0_0min_to_15min_rat3 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tabb_cropped_crf0_0min_to_15min_rat4 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tcbd_cropped_crf0_0min_to_15min_rat1 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tcbd_cropped_crf0_0min_to_15min_rat2 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tcbd_cropped_crf0_0min_to_15min_rat3 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tcbd_cropped_crf0_0min_to_15min_rat4 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tebf_cropped_crf0_0min_to_15min_rat1 | 26970 | 0.2% |
| 22-01-10_baseline_1_djl_tebf_cropped_crf0_0min_to_15min_rat2 | 26970 | 0.2% |
| Other values (458) | 12352260 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 126219600 | |
| 0 | 48546000 | 6.7% |
| 2 | 37137690 | 5.1% |
| e | 35924040 | 4.9% |
| 1 | 33254010 | 4.6% |
| i | 27833040 | 3.8% |
| n | 27833040 | 3.8% |
| t | 26214840 | 3.6% |
| R | 26214840 | 3.6% |
| o | 25243920 | 3.5% |
| Other values (41) | 313768980 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 264737520 | |
| Decimal Number | 159554520 | |
| Uppercase Letter | 152434440 | |
| Connector Punctuation | 126219600 | |
| Dash Punctuation | 25243920 | 3.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 26214840 | |
| B | 16182000 | |
| T | 14671680 | |
| D | 14563800 | |
| F | 13700760 | |
| J | 13592880 | |
| L | 13592880 | |
| C | 13592880 | |
| W | 10140720 | 6.7% |
| K | 1186680 | 0.8% |
| Other values (14) | 14995320 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 35924040 | |
| i | 27833040 | |
| n | 27833040 | |
| t | 26214840 | |
| o | 25243920 | |
| p | 25243920 | |
| m | 25243920 | |
| a | 15211080 | |
| r | 14563800 | |
| c | 12621960 | 4.8% |
| Other values (6) | 28803960 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 48546000 | |
| 2 | 37137690 | |
| 1 | 33254010 | |
| 5 | 17152920 | 10.8% |
| 4 | 8981010 | 5.6% |
| 3 | 8010090 | 5.0% |
| 6 | 3236400 | 2.0% |
| 8 | 2589120 | 1.6% |
| 9 | 647280 | 0.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 126219600 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25243920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 417171960 | |
| Common | 311018040 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 35924040 | 8.6% |
| i | 27833040 | 6.7% |
| n | 27833040 | 6.7% |
| t | 26214840 | 6.3% |
| R | 26214840 | 6.3% |
| o | 25243920 | 6.1% |
| p | 25243920 | 6.1% |
| m | 25243920 | 6.1% |
| B | 16182000 | 3.9% |
| a | 15211080 | 3.6% |
| Other values (30) | 166027320 |
Common
| Value | Count | Frequency (%) |
| _ | 126219600 | |
| 0 | 48546000 | 15.6% |
| 2 | 37137690 | 11.9% |
| 1 | 33254010 | 10.7% |
| - | 25243920 | 8.1% |
| 5 | 17152920 | 5.5% |
| 4 | 8981010 | 2.9% |
| 3 | 8010090 | 2.6% |
| 6 | 3236400 | 1.0% |
| 8 | 2589120 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 728190000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 126219600 | |
| 0 | 48546000 | 6.7% |
| 2 | 37137690 | 5.1% |
| e | 35924040 | 4.9% |
| 1 | 33254010 | 4.6% |
| i | 27833040 | 3.8% |
| n | 27833040 | 3.8% |
| t | 26214840 | 3.6% |
| R | 26214840 | 3.6% |
| o | 25243920 | 3.5% |
| Other values (41) | 313768980 |
frame
Real number (ℝ)
| Distinct | 26970 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13484.5 |
| Minimum | 0 |
|---|---|
| Maximum | 26969 |
| Zeros | 468 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1348 |
| Q1 | 6742 |
| median | 13484.5 |
| Q3 | 20227 |
| 95-th percentile | 25621 |
| Maximum | 26969 |
| Range | 26969 |
| Interquartile range (IQR) | 13485 |
Descriptive statistics
| Standard deviation | 7785.5687 |
|---|---|
| Coefficient of variation (CV) | 0.5773717 |
| Kurtosis | -1.2 |
| Mean | 13484.5 |
| Median Absolute Deviation (MAD) | 6742.5 |
| Skewness | 0 |
| Sum | 1.7020082 × 1011 |
| Variance | 60615080 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 468 | < 0.1% |
| 17963 | 468 | < 0.1% |
| 17987 | 468 | < 0.1% |
| 17986 | 468 | < 0.1% |
| 17985 | 468 | < 0.1% |
| 17984 | 468 | < 0.1% |
| 17983 | 468 | < 0.1% |
| 17982 | 468 | < 0.1% |
| 17981 | 468 | < 0.1% |
| 17980 | 468 | < 0.1% |
| Other values (26960) | 12617280 |
| Value | Count | Frequency (%) |
| 0 | 468 | |
| 1 | 468 | |
| 2 | 468 | |
| 3 | 468 | |
| 4 | 468 | |
| 5 | 468 | |
| 6 | 468 | |
| 7 | 468 | |
| 8 | 468 | |
| 9 | 468 |
| Value | Count | Frequency (%) |
| 26969 | 468 | |
| 26968 | 468 | |
| 26967 | 468 | |
| 26966 | 468 | |
| 26965 | 468 | |
| 26964 | 468 | |
| 26963 | 468 | |
| 26962 | 468 | |
| 26961 | 468 | |
| 26960 | 468 |
motif_hmm_650
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.139519 |
| Minimum | 0 |
|---|---|
| Maximum | 39 |
| Zeros | 415439 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 18 |
| Q3 | 27 |
| 95-th percentile | 36 |
| Maximum | 39 |
| Range | 39 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 11.058083 |
|---|---|
| Coefficient of variation (CV) | 0.60961283 |
| Kurtosis | -1.0404659 |
| Mean | 18.139519 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.091913321 |
| Sum | 2.2895628 × 108 |
| Variance | 122.28121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 549827 | 4.4% |
| 22 | 505043 | 4.0% |
| 23 | 500697 | 4.0% |
| 2 | 494792 | 3.9% |
| 15 | 486322 | 3.9% |
| 28 | 481264 | 3.8% |
| 25 | 470826 | 3.7% |
| 8 | 463476 | 3.7% |
| 1 | 446186 | 3.5% |
| 0 | 415439 | 3.3% |
| Other values (30) | 7808088 |
| Value | Count | Frequency (%) |
| 0 | 415439 | |
| 1 | 446186 | |
| 2 | 494792 | |
| 3 | 185081 | 1.5% |
| 4 | 241433 | |
| 5 | 275160 | |
| 6 | 228702 | |
| 7 | 296887 | |
| 8 | 463476 | |
| 9 | 219514 |
| Value | Count | Frequency (%) |
| 39 | 131031 | 1.0% |
| 38 | 264651 | |
| 37 | 198652 | |
| 36 | 371074 | |
| 35 | 262182 | |
| 34 | 267704 | |
| 33 | 229111 | |
| 32 | 305242 | |
| 31 | 233 | < 0.1% |
| 30 | 246862 |
centroid_x
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8477894 |
|---|---|
| Distinct (%) | 67.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 299.58075 |
| Minimum | 21.309204 |
|---|---|
| Maximum | 630.2107 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 21.309204 |
|---|---|
| 5-th percentile | 58.31167 |
| Q1 | 80.336666 |
| median | 334.48778 |
| Q3 | 483.84117 |
| 95-th percentile | 592.8322 |
| Maximum | 630.2107 |
| Range | 608.9015 |
| Interquartile range (IQR) | 403.5045 |
Descriptive statistics
| Standard deviation | 201.53799 |
|---|---|
| Coefficient of variation (CV) | 0.67273344 |
| Kurtosis | -1.5410139 |
| Mean | 299.58075 |
| Median Absolute Deviation (MAD) | 230.52156 |
| Skewness | 0.1169313 |
| Sum | 3.7812962 × 109 |
| Variance | 40617.56 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 592.5728 | 16 | < 0.1% |
| 582.30963 | 15 | < 0.1% |
| 575.4343 | 15 | < 0.1% |
| 581.4392 | 15 | < 0.1% |
| 593.27814 | 14 | < 0.1% |
| 587.7342 | 14 | < 0.1% |
| 580.50073 | 14 | < 0.1% |
| 594.2528 | 14 | < 0.1% |
| 589.75006 | 14 | < 0.1% |
| 593.4699 | 14 | < 0.1% |
| Other values (8477884) | 12621815 |
| Value | Count | Frequency (%) |
| 21.309204 | 1 | |
| 21.342188 | 1 | |
| 21.629887 | 1 | |
| 21.700506 | 1 | |
| 22.305332 | 1 | |
| 22.331964 | 1 | |
| 22.955336 | 1 | |
| 23.167027 | 1 | |
| 23.35288 | 1 | |
| 23.456802 | 1 |
| Value | Count | Frequency (%) |
| 630.2107 | 1 | |
| 630.0491 | 1 | |
| 628.40826 | 1 | |
| 627.9597 | 1 | |
| 626.09924 | 1 | |
| 625.4818 | 1 | |
| 625.33374 | 1 | |
| 625.3282 | 1 | |
| 625.0347 | 1 | |
| 625.0173 | 1 |
centroid_y
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8011955 |
|---|---|
| Distinct (%) | 63.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 345.26661 |
| Minimum | 33.870197 |
|---|---|
| Maximum | 625.66895 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 33.870197 |
|---|---|
| 5-th percentile | 68.470696 |
| Q1 | 222.9681 |
| median | 328.47266 |
| Q3 | 531.19366 |
| 95-th percentile | 589.77344 |
| Maximum | 625.66895 |
| Range | 591.79875 |
| Interquartile range (IQR) | 308.22556 |
Descriptive statistics
| Standard deviation | 174.50349 |
|---|---|
| Coefficient of variation (CV) | 0.50541665 |
| Kurtosis | -1.2538612 |
| Mean | 345.26661 |
| Median Absolute Deviation (MAD) | 151.54609 |
| Skewness | -0.028541104 |
| Sum | 4.3579414 × 109 |
| Variance | 30451.47 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 577.58014 | 17 | < 0.1% |
| 577.15295 | 17 | < 0.1% |
| 578.5644 | 16 | < 0.1% |
| 576.8431 | 16 | < 0.1% |
| 579.9652 | 15 | < 0.1% |
| 570.84534 | 15 | < 0.1% |
| 586.84174 | 15 | < 0.1% |
| 586.8863 | 15 | < 0.1% |
| 579.7055 | 15 | < 0.1% |
| 586.18744 | 15 | < 0.1% |
| Other values (8011945) | 12621804 |
| Value | Count | Frequency (%) |
| 33.870197 | 1 | |
| 33.871677 | 1 | |
| 33.955666 | 1 | |
| 33.96398 | 1 | |
| 34.07011 | 1 | |
| 34.398083 | 1 | |
| 34.50547 | 1 | |
| 34.568867 | 1 | |
| 34.629757 | 1 | |
| 34.79434 | 1 |
| Value | Count | Frequency (%) |
| 625.66895 | 1 | |
| 625.0903 | 1 | |
| 624.2235 | 1 | |
| 622.7253 | 1 | |
| 620.85626 | 1 | |
| 620.7995 | 1 | |
| 620.5762 | 1 | |
| 620.1259 | 1 | |
| 620.05914 | 1 | |
| 620.01697 | 1 |
in_center
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
| 0.0 | |
|---|---|
| 1.0 | 779187 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 37865880 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 11842773 | |
| 1.0 | 779187 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 11842773 | |
| 1.0 | 779187 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 24464733 | |
| . | 12621960 | |
| 1 | 779187 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25243920 | |
| Other Punctuation | 12621960 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24464733 | |
| 1 | 779187 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 12621960 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37865880 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 24464733 | |
| . | 12621960 | |
| 1 | 779187 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37865880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 24464733 | |
| . | 12621960 | |
| 1 | 779187 | 2.1% |
distance
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 11346129 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1773651 |
| Minimum | 7.142701 × 10-6 |
|---|---|
| Maximum | 11.729444 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 7.142701 × 10-6 |
|---|---|
| 5-th percentile | 0.0073130835 |
| Q1 | 0.028875574 |
| median | 0.074520355 |
| Q3 | 0.18493218 |
| 95-th percentile | 0.78468028 |
| Maximum | 11.729444 |
| Range | 11.729437 |
| Interquartile range (IQR) | 0.1560566 |
Descriptive statistics
| Standard deviation | 0.28175685 |
|---|---|
| Coefficient of variation (CV) | 1.5885699 |
| Kurtosis | 14.088951 |
| Mean | 0.1773651 |
| Median Absolute Deviation (MAD) | 0.055968503 |
| Skewness | 3.2648956 |
| Sum | 2238695.2 |
| Variance | 0.079386924 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.008894745 | 182 | < 0.1% |
| 0.04950115 | 132 | < 0.1% |
| 0.013804477 | 82 | < 0.1% |
| 0.05827828 | 66 | < 0.1% |
| 0.008747817 | 53 | < 0.1% |
| 0.13631444 | 50 | < 0.1% |
| 0.06681175 | 49 | < 0.1% |
| 0.03953116 | 48 | < 0.1% |
| 0.13740107 | 40 | < 0.1% |
| 0.084319 | 36 | < 0.1% |
| Other values (11346119) | 12621222 |
| Value | Count | Frequency (%) |
| 7.142701 × 10-6 | 1 | |
| 8.833231 × 10-6 | 1 | |
| 1.5805097 × 10-5 | 1 | |
| 1.907038 × 10-5 | 1 | |
| 1.9821771 × 10-5 | 1 | |
| 2.0852629 × 10-5 | 1 | |
| 2.1335338 × 10-5 | 1 | |
| 2.1438038 × 10-5 | 1 | |
| 2.3450972 × 10-5 | 1 | |
| 2.4067731 × 10-5 | 1 |
| Value | Count | Frequency (%) |
| 11.7294445 | 1 | |
| 11.706009 | 1 | |
| 11.17524 | 1 | |
| 9.56251 | 1 | |
| 9.482468 | 1 | |
| 9.046187 | 1 | |
| 8.752678 | 1 | |
| 8.302079 | 1 | |
| 8.046537 | 1 | |
| 8.038197 | 1 |
speed
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 11267460 |
|---|---|
| Distinct (%) | 89.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.3215854 |
| Minimum | 0.00796373 |
|---|---|
| Maximum | 116.38814 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 0.00796373 |
|---|---|
| 5-th percentile | 0.29368799 |
| Q1 | 0.96012717 |
| median | 2.3152779 |
| Q3 | 5.5152233 |
| 95-th percentile | 23.273718 |
| Maximum | 116.38814 |
| Range | 116.38018 |
| Interquartile range (IQR) | 4.5550961 |
Descriptive statistics
| Standard deviation | 8.2763471 |
|---|---|
| Coefficient of variation (CV) | 1.5552409 |
| Kurtosis | 12.126764 |
| Mean | 5.3215854 |
| Median Absolute Deviation (MAD) | 1.6642217 |
| Skewness | 3.1918233 |
| Sum | 67168838 |
| Variance | 68.497921 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.26684237 | 178 | < 0.1% |
| 1.4850345 | 128 | < 0.1% |
| 0.4141343 | 79 | < 0.1% |
| 1.7483484 | 63 | < 0.1% |
| 0.2624345 | 49 | < 0.1% |
| 2.0043526 | 46 | < 0.1% |
| 4.089433 | 46 | < 0.1% |
| 1.1859348 | 45 | < 0.1% |
| 4.122032 | 36 | < 0.1% |
| 2.52957 | 32 | < 0.1% |
| Other values (11267450) | 12621258 |
| Value | Count | Frequency (%) |
| 0.00796373 | 1 | |
| 0.008488682 | 1 | |
| 0.008681333 | 1 | |
| 0.009028143 | 1 | |
| 0.010097986 | 1 | |
| 0.010422492 | 1 | |
| 0.01080666 | 1 | |
| 0.010972135 | 1 | |
| 0.011098506 | 1 | |
| 0.011235485 | 1 |
| Value | Count | Frequency (%) |
| 116.38814 | 1 | |
| 114.620026 | 1 | |
| 112.355034 | 1 | |
| 110.52341 | 1 | |
| 108.23665 | 1 | |
| 105.33802 | 1 | |
| 101.9892 | 1 | |
| 101.62444 | 1 | |
| 100.751114 | 1 | |
| 100.62463 | 1 |
rat
Categorical
HIGH CORRELATION  UNIFORM 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
| Rat1 | |
|---|---|
| Rat2 | |
| Rat3 | |
| Rat4 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 50487840 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rat1 |
|---|---|
| 2nd row | Rat1 |
| 3rd row | Rat1 |
| 4th row | Rat1 |
| 5th row | Rat1 |
Common Values
| Value | Count | Frequency (%) |
| Rat1 | 3155490 | |
| Rat2 | 3155490 | |
| Rat3 | 3155490 | |
| Rat4 | 3155490 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rat1 | 3155490 | |
| rat2 | 3155490 | |
| rat3 | 3155490 | |
| rat4 | 3155490 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 12621960 | |
| a | 12621960 | |
| t | 12621960 | |
| 1 | 3155490 | 6.2% |
| 2 | 3155490 | 6.2% |
| 3 | 3155490 | 6.2% |
| 4 | 3155490 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25243920 | |
| Uppercase Letter | 12621960 | |
| Decimal Number | 12621960 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3155490 | |
| 2 | 3155490 | |
| 3 | 3155490 | |
| 4 | 3155490 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12621960 | |
| t | 12621960 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 12621960 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37865880 | |
| Common | 12621960 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3155490 | |
| 2 | 3155490 | |
| 3 | 3155490 | |
| 4 | 3155490 |
Latin
| Value | Count | Frequency (%) |
| R | 12621960 | |
| a | 12621960 | |
| t | 12621960 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50487840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 12621960 | |
| a | 12621960 | |
| t | 12621960 | |
| 1 | 3155490 | 6.2% |
| 2 | 3155490 | 6.2% |
| 3 | 3155490 | 6.2% |
| 4 | 3155490 | 6.2% |
rat_id
Categorical
HIGH CORRELATION 
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
| A1 | 269700 |
|---|---|
| Y1 | 269700 |
| M1 | 269700 |
| K1 | 269700 |
| F2 | 269700 |
| Other values (43) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 25243920 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A1 |
|---|---|
| 2nd row | A1 |
| 3rd row | A1 |
| 4th row | A1 |
| 5th row | A1 |
Common Values
| Value | Count | Frequency (%) |
| A1 | 269700 | 2.1% |
| Y1 | 269700 | 2.1% |
| M1 | 269700 | 2.1% |
| K1 | 269700 | 2.1% |
| F2 | 269700 | 2.1% |
| P2 | 269700 | 2.1% |
| D2 | 269700 | 2.1% |
| E1 | 269700 | 2.1% |
| E2 | 269700 | 2.1% |
| F1 | 269700 | 2.1% |
| Other values (38) | 9924960 |
Length
| Value | Count | Frequency (%) |
| a1 | 269700 | 2.1% |
| x2 | 269700 | 2.1% |
| l2 | 269700 | 2.1% |
| a2 | 269700 | 2.1% |
| x1 | 269700 | 2.1% |
| y1 | 269700 | 2.1% |
| n2 | 269700 | 2.1% |
| o1 | 269700 | 2.1% |
| o2 | 269700 | 2.1% |
| m2 | 269700 | 2.1% |
| Other values (38) | 9924960 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6310980 | |
| 1 | 6310980 | |
| A | 539400 | 2.1% |
| I | 539400 | 2.1% |
| W | 539400 | 2.1% |
| T | 539400 | 2.1% |
| S | 539400 | 2.1% |
| O | 539400 | 2.1% |
| N | 539400 | 2.1% |
| X | 539400 | 2.1% |
| Other values (16) | 8306760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12621960 | |
| Uppercase Letter | 12621960 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 539400 | 4.3% |
| I | 539400 | 4.3% |
| W | 539400 | 4.3% |
| T | 539400 | 4.3% |
| S | 539400 | 4.3% |
| O | 539400 | 4.3% |
| N | 539400 | 4.3% |
| X | 539400 | 4.3% |
| H | 539400 | 4.3% |
| L | 539400 | 4.3% |
| Other values (14) | 7227960 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6310980 | |
| 1 | 6310980 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12621960 | |
| Latin | 12621960 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 539400 | 4.3% |
| I | 539400 | 4.3% |
| W | 539400 | 4.3% |
| T | 539400 | 4.3% |
| S | 539400 | 4.3% |
| O | 539400 | 4.3% |
| N | 539400 | 4.3% |
| X | 539400 | 4.3% |
| H | 539400 | 4.3% |
| L | 539400 | 4.3% |
| Other values (14) | 7227960 |
Common
| Value | Count | Frequency (%) |
| 2 | 6310980 | |
| 1 | 6310980 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25243920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6310980 | |
| 1 | 6310980 | |
| A | 539400 | 2.1% |
| I | 539400 | 2.1% |
| W | 539400 | 2.1% |
| T | 539400 | 2.1% |
| S | 539400 | 2.1% |
| O | 539400 | 2.1% |
| N | 539400 | 2.1% |
| X | 539400 | 2.1% |
| Other values (16) | 8306760 |
group
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
| Sham | |
|---|---|
| Injured | |
| Treated | |
| ABX |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 5.3076923 |
| Min length | 3 |
Characters and Unicode
| Total characters | 66993480 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sham |
|---|---|
| 2nd row | Sham |
| 3rd row | Sham |
| 4th row | Sham |
| 5th row | Sham |
Common Values
| Value | Count | Frequency (%) |
| Sham | 3236400 | |
| Injured | 3236400 | |
| Treated | 3236400 | |
| ABX | 2912760 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sham | 3236400 | |
| injured | 3236400 | |
| treated | 3236400 | |
| abx | 2912760 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9709200 | |
| a | 6472800 | 9.7% |
| r | 6472800 | 9.7% |
| d | 6472800 | 9.7% |
| S | 3236400 | 4.8% |
| h | 3236400 | 4.8% |
| m | 3236400 | 4.8% |
| I | 3236400 | 4.8% |
| n | 3236400 | 4.8% |
| j | 3236400 | 4.8% |
| Other values (6) | 18447480 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48546000 | |
| Uppercase Letter | 18447480 | 27.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9709200 | |
| a | 6472800 | |
| r | 6472800 | |
| d | 6472800 | |
| h | 3236400 | 6.7% |
| m | 3236400 | 6.7% |
| n | 3236400 | 6.7% |
| j | 3236400 | 6.7% |
| u | 3236400 | 6.7% |
| t | 3236400 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3236400 | |
| I | 3236400 | |
| T | 3236400 | |
| A | 2912760 | |
| B | 2912760 | |
| X | 2912760 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66993480 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9709200 | |
| a | 6472800 | 9.7% |
| r | 6472800 | 9.7% |
| d | 6472800 | 9.7% |
| S | 3236400 | 4.8% |
| h | 3236400 | 4.8% |
| m | 3236400 | 4.8% |
| I | 3236400 | 4.8% |
| n | 3236400 | 4.8% |
| j | 3236400 | 4.8% |
| Other values (6) | 18447480 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66993480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9709200 | |
| a | 6472800 | 9.7% |
| r | 6472800 | 9.7% |
| d | 6472800 | 9.7% |
| S | 3236400 | 4.8% |
| h | 3236400 | 4.8% |
| m | 3236400 | 4.8% |
| I | 3236400 | 4.8% |
| n | 3236400 | 4.8% |
| j | 3236400 | 4.8% |
| Other values (6) | 18447480 |
time_point
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.6 MiB |
| Baseline_1 | |
|---|---|
| Baseline_2 | |
| Week_02 | |
| Week_04 | |
| Week_06 | |
| Other values (5) |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.6923077 |
| Min length | 7 |
Characters and Unicode
| Total characters | 97092000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Baseline_1 |
|---|---|
| 2nd row | Baseline_1 |
| 3rd row | Baseline_1 |
| 4th row | Baseline_1 |
| 5th row | Baseline_1 |
Common Values
| Value | Count | Frequency (%) |
| Baseline_1 | 1294560 | |
| Baseline_2 | 1294560 | |
| Week_02 | 1294560 | |
| Week_04 | 1294560 | |
| Week_06 | 1294560 | |
| Week_08 | 1294560 | |
| Week_11 | 1294560 | |
| Week_13 | 1294560 | |
| Week_15 | 1294560 | |
| Drug_Trt | 970920 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| baseline_1 | 1294560 | |
| baseline_2 | 1294560 | |
| week_02 | 1294560 | |
| week_04 | 1294560 | |
| week_06 | 1294560 | |
| week_08 | 1294560 | |
| week_11 | 1294560 | |
| week_13 | 1294560 | |
| week_15 | 1294560 | |
| drug_trt | 970920 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 23302080 | |
| _ | 12621960 | |
| k | 9061920 | 9.3% |
| W | 9061920 | 9.3% |
| 1 | 6472800 | 6.7% |
| 0 | 5178240 | 5.3% |
| a | 2589120 | 2.7% |
| 2 | 2589120 | 2.7% |
| B | 2589120 | 2.7% |
| n | 2589120 | 2.7% |
| Other values (14) | 21036600 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50164200 | |
| Decimal Number | 20712960 | |
| Uppercase Letter | 13592880 | 14.0% |
| Connector Punctuation | 12621960 | 13.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 23302080 | |
| k | 9061920 | 18.1% |
| a | 2589120 | 5.2% |
| n | 2589120 | 5.2% |
| i | 2589120 | 5.2% |
| l | 2589120 | 5.2% |
| s | 2589120 | 5.2% |
| r | 1941840 | 3.9% |
| u | 970920 | 1.9% |
| g | 970920 | 1.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6472800 | |
| 0 | 5178240 | |
| 2 | 2589120 | 12.5% |
| 4 | 1294560 | 6.2% |
| 6 | 1294560 | 6.2% |
| 8 | 1294560 | 6.2% |
| 3 | 1294560 | 6.2% |
| 5 | 1294560 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 9061920 | |
| B | 2589120 | 19.0% |
| D | 970920 | 7.1% |
| T | 970920 | 7.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12621960 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63757080 | |
| Common | 33334920 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 23302080 | |
| k | 9061920 | 14.2% |
| W | 9061920 | 14.2% |
| a | 2589120 | 4.1% |
| B | 2589120 | 4.1% |
| n | 2589120 | 4.1% |
| i | 2589120 | 4.1% |
| l | 2589120 | 4.1% |
| s | 2589120 | 4.1% |
| r | 1941840 | 3.0% |
| Other values (5) | 4854600 | 7.6% |
Common
| Value | Count | Frequency (%) |
| _ | 12621960 | |
| 1 | 6472800 | |
| 0 | 5178240 | |
| 2 | 2589120 | 7.8% |
| 4 | 1294560 | 3.9% |
| 6 | 1294560 | 3.9% |
| 8 | 1294560 | 3.9% |
| 3 | 1294560 | 3.9% |
| 5 | 1294560 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97092000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 23302080 | |
| _ | 12621960 | |
| k | 9061920 | 9.3% |
| W | 9061920 | 9.3% |
| 1 | 6472800 | 6.7% |
| 0 | 5178240 | 5.3% |
| a | 2589120 | 2.7% |
| 2 | 2589120 | 2.7% |
| B | 2589120 | 2.7% |
| n | 2589120 | 2.7% |
| Other values (14) | 21036600 |
Motif
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.139519 |
| Minimum | 0 |
|---|---|
| Maximum | 39 |
| Zeros | 415439 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 18 |
| Q3 | 27 |
| 95-th percentile | 36 |
| Maximum | 39 |
| Range | 39 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 11.058083 |
|---|---|
| Coefficient of variation (CV) | 0.60961283 |
| Kurtosis | -1.0404659 |
| Mean | 18.139519 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.091913321 |
| Sum | 2.2895628 × 108 |
| Variance | 122.28121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 549827 | 4.4% |
| 22 | 505043 | 4.0% |
| 23 | 500697 | 4.0% |
| 2 | 494792 | 3.9% |
| 15 | 486322 | 3.9% |
| 28 | 481264 | 3.8% |
| 25 | 470826 | 3.7% |
| 8 | 463476 | 3.7% |
| 1 | 446186 | 3.5% |
| 0 | 415439 | 3.3% |
| Other values (30) | 7808088 |
| Value | Count | Frequency (%) |
| 0 | 415439 | |
| 1 | 446186 | |
| 2 | 494792 | |
| 3 | 185081 | 1.5% |
| 4 | 241433 | |
| 5 | 275160 | |
| 6 | 228702 | |
| 7 | 296887 | |
| 8 | 463476 | |
| 9 | 219514 |
| Value | Count | Frequency (%) |
| 39 | 131031 | 1.0% |
| 38 | 264651 | |
| 37 | 198652 | |
| 36 | 371074 | |
| 35 | 262182 | |
| 34 | 267704 | |
| 33 | 229111 | |
| 32 | 305242 | |
| 31 | 233 | < 0.1% |
| 30 | 246862 |
Exclude
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 108.3 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 11360481 | |
| True | 1261479 | 10.0% |
Moving_Quickly
Boolean
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5166976 |
| Missing (%) | 40.9% |
| Memory size | 192.6 MiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 4230312 | |
| True | 3224672 | |
| (Missing) | 5166976 |
Predominant_Behavior
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 909134 |
| Missing (%) | 7.2% |
| Memory size | 192.6 MiB |
| Sniffing | |
|---|---|
| Rearing | |
| Stationary | |
| Grooming | |
| Walking |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 7.9898179 |
| Min length | 7 |
Characters and Unicode
| Total characters | 93583347 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rearing |
|---|---|
| 2nd row | Rearing |
| 3rd row | Rearing |
| 4th row | Rearing |
| 5th row | Rearing |
Common Values
| Value | Count | Frequency (%) |
| Sniffing | 5184591 | |
| Rearing | 3060784 | |
| Stationary | 1464819 | 11.6% |
| Grooming | 1168789 | 9.3% |
| Walking | 728127 | 5.8% |
| Mixed behaviors | 105716 | 0.8% |
| (Missing) | 909134 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sniffing | 5184591 | |
| rearing | 3060784 | |
| stationary | 1464819 | 12.4% |
| grooming | 1168789 | 9.9% |
| walking | 728127 | 6.2% |
| mixed | 105716 | 0.9% |
| behaviors | 105716 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 17003133 | |
| n | 16791701 | |
| f | 10369182 | |
| g | 10142291 | |
| a | 6824265 | |
| S | 6649410 | 7.1% |
| r | 5800108 | 6.2% |
| o | 3908113 | 4.2% |
| e | 3272216 | 3.5% |
| R | 3060784 | 3.3% |
| Other values (15) | 9762144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 81764805 | |
| Uppercase Letter | 11712826 | 12.5% |
| Space Separator | 105716 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 17003133 | |
| n | 16791701 | |
| f | 10369182 | |
| g | 10142291 | |
| a | 6824265 | |
| r | 5800108 | 7.1% |
| o | 3908113 | 4.8% |
| e | 3272216 | 4.0% |
| t | 2929638 | 3.6% |
| y | 1464819 | 1.8% |
| Other values (9) | 3259339 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6649410 | |
| R | 3060784 | |
| G | 1168789 | 10.0% |
| W | 728127 | 6.2% |
| M | 105716 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 105716 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 93477631 | |
| Common | 105716 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 17003133 | |
| n | 16791701 | |
| f | 10369182 | |
| g | 10142291 | |
| a | 6824265 | |
| S | 6649410 | 7.1% |
| r | 5800108 | 6.2% |
| o | 3908113 | 4.2% |
| e | 3272216 | 3.5% |
| R | 3060784 | 3.3% |
| Other values (14) | 9656428 |
Common
| Value | Count | Frequency (%) |
| 105716 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93583347 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 17003133 | |
| n | 16791701 | |
| f | 10369182 | |
| g | 10142291 | |
| a | 6824265 | |
| S | 6649410 | 7.1% |
| r | 5800108 | 6.2% |
| o | 3908113 | 4.2% |
| e | 3272216 | 3.5% |
| R | 3060784 | 3.3% |
| Other values (15) | 9762144 |
Secondary_Descriptor
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8290492 |
| Missing (%) | 65.7% |
| Memory size | 192.6 MiB |
| Active | |
|---|---|
| Stationary | |
| Quick | 264651 |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.6933319 |
| Min length | 5 |
Characters and Unicode
| Total characters | 28991953 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Active |
|---|---|
| 2nd row | Active |
| 3rd row | Active |
| 4th row | Active |
| 5th row | Active |
Common Values
| Value | Count | Frequency (%) |
| Active | 3249868 | 25.7% |
| Stationary | 816949 | 6.5% |
| Quick | 264651 | 2.1% |
| (Missing) | 8290492 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| active | 3249868 | |
| stationary | 816949 | 18.9% |
| quick | 264651 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4883766 | |
| i | 4331468 | |
| c | 3514519 | |
| A | 3249868 | |
| v | 3249868 | |
| e | 3249868 | |
| a | 1633898 | 5.6% |
| S | 816949 | 2.8% |
| o | 816949 | 2.8% |
| n | 816949 | 2.8% |
| Other values (5) | 2427851 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24660485 | |
| Uppercase Letter | 4331468 | 14.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4883766 | |
| i | 4331468 | |
| c | 3514519 | |
| v | 3249868 | |
| e | 3249868 | |
| a | 1633898 | 6.6% |
| o | 816949 | 3.3% |
| n | 816949 | 3.3% |
| r | 816949 | 3.3% |
| y | 816949 | 3.3% |
| Other values (2) | 529302 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3249868 | |
| S | 816949 | 18.9% |
| Q | 264651 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28991953 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4883766 | |
| i | 4331468 | |
| c | 3514519 | |
| A | 3249868 | |
| v | 3249868 | |
| e | 3249868 | |
| a | 1633898 | 5.6% |
| S | 816949 | 2.8% |
| o | 816949 | 2.8% |
| n | 816949 | 2.8% |
| Other values (5) | 2427851 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28991953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4883766 | |
| i | 4331468 | |
| c | 3514519 | |
| A | 3249868 | |
| v | 3249868 | |
| e | 3249868 | |
| a | 1633898 | 5.6% |
| S | 816949 | 2.8% |
| o | 816949 | 2.8% |
| n | 816949 | 2.8% |
| Other values (5) | 2427851 |
Category
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 909134 |
| Missing (%) | 7.2% |
| Memory size | 192.6 MiB |
| Exploratory | |
|---|---|
| Locomotor | |
| Resting | |
| Mixed | 105716 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.166887 |
| Min length | 5 |
Characters and Unicode
| Total characters | 119082984 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Exploratory |
|---|---|
| 2nd row | Exploratory |
| 3rd row | Exploratory |
| 4th row | Exploratory |
| 5th row | Exploratory |
Common Values
| Value | Count | Frequency (%) |
| Exploratory | 8510026 | |
| Locomotor | 1632265 | 12.9% |
| Resting | 1464819 | 11.6% |
| Mixed | 105716 | 0.8% |
| (Missing) | 909134 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| exploratory | 8510026 | |
| locomotor | 1632265 | 13.9% |
| resting | 1464819 | 12.5% |
| mixed | 105716 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 23549112 | |
| r | 18652317 | |
| t | 11607110 | |
| x | 8615742 | 7.2% |
| E | 8510026 | 7.1% |
| p | 8510026 | 7.1% |
| l | 8510026 | 7.1% |
| a | 8510026 | 7.1% |
| y | 8510026 | 7.1% |
| m | 1632265 | 1.4% |
| Other values (10) | 12476308 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 107370158 | |
| Uppercase Letter | 11712826 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 23549112 | |
| r | 18652317 | |
| t | 11607110 | |
| x | 8615742 | 8.0% |
| p | 8510026 | 7.9% |
| l | 8510026 | 7.9% |
| a | 8510026 | 7.9% |
| y | 8510026 | 7.9% |
| m | 1632265 | 1.5% |
| c | 1632265 | 1.5% |
| Other values (6) | 7641243 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 8510026 | |
| L | 1632265 | 13.9% |
| R | 1464819 | 12.5% |
| M | 105716 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119082984 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 23549112 | |
| r | 18652317 | |
| t | 11607110 | |
| x | 8615742 | 7.2% |
| E | 8510026 | 7.1% |
| p | 8510026 | 7.1% |
| l | 8510026 | 7.1% |
| a | 8510026 | 7.1% |
| y | 8510026 | 7.1% |
| m | 1632265 | 1.4% |
| Other values (10) | 12476308 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 119082984 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 23549112 | |
| r | 18652317 | |
| t | 11607110 | |
| x | 8615742 | 7.2% |
| E | 8510026 | 7.1% |
| p | 8510026 | 7.1% |
| l | 8510026 | 7.1% |
| a | 8510026 | 7.1% |
| y | 8510026 | 7.1% |
| m | 1632265 | 1.4% |
| Other values (10) | 12476308 |
| Category | Exclude | Motif | Moving_Quickly | Predominant_Behavior | Secondary_Descriptor | centroid_x | centroid_y | distance | frame | group | in_center | motif_hmm_650 | rat | rat_id | speed | time_point | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Category | 1.000 | 0.622 | 0.169 | 0.358 | 0.979 | 1.000 | 0.018 | 0.029 | -0.349 | 0.181 | 0.031 | 0.040 | 0.169 | 0.048 | 0.110 | -0.375 | 0.055 |
| Exclude | 0.622 | 1.000 | 0.101 | 1.000 | 0.656 | 1.000 | -0.007 | 0.036 | -0.122 | -0.014 | 0.019 | 0.038 | 0.101 | 0.057 | 0.085 | -0.134 | 0.060 |
| Motif | 0.169 | 0.101 | 1.000 | 0.635 | 0.505 | 0.827 | 0.014 | 0.017 | -0.144 | 0.061 | 0.035 | 0.077 | 1.000 | 0.100 | 0.088 | -0.155 | 0.038 |
| Moving_Quickly | 0.358 | 1.000 | 0.635 | 1.000 | 0.624 | 0.564 | 0.004 | -0.088 | 0.424 | -0.232 | 0.026 | 0.058 | 0.055 | 0.074 | 0.126 | 0.464 | 0.084 |
| Predominant_Behavior | 0.979 | 0.656 | 0.505 | 0.624 | 1.000 | 1.000 | 0.090 | -0.070 | 0.018 | 0.008 | 0.047 | 0.093 | -0.067 | 0.104 | 0.126 | 0.020 | 0.060 |
| Secondary_Descriptor | 1.000 | 1.000 | 0.827 | 0.564 | 1.000 | 1.000 | 0.004 | -0.015 | -0.391 | 0.167 | 0.035 | 0.046 | 0.357 | 0.077 | 0.157 | -0.426 | 0.135 |
| centroid_x | 0.018 | -0.007 | 0.014 | 0.004 | 0.090 | 0.004 | 1.000 | -0.001 | 0.077 | -0.034 | 0.047 | 0.362 | 0.014 | 0.581 | 0.356 | 0.080 | 0.038 |
| centroid_y | 0.029 | 0.036 | 0.017 | -0.088 | -0.070 | -0.015 | -0.001 | 1.000 | -0.043 | 0.005 | 0.134 | 0.293 | 0.017 | 0.582 | 0.326 | -0.045 | 0.030 |
| distance | -0.349 | -0.122 | -0.144 | 0.424 | 0.018 | -0.391 | 0.077 | -0.043 | 1.000 | -0.248 | 0.006 | 0.093 | -0.144 | 0.013 | 0.021 | 0.935 | 0.010 |
| frame | 0.181 | -0.014 | 0.061 | -0.232 | 0.008 | 0.167 | -0.034 | 0.005 | -0.248 | 1.000 | 0.000 | 0.030 | 0.061 | 0.000 | 0.000 | -0.261 | 0.000 |
| group | 0.031 | 0.019 | 0.035 | 0.026 | 0.047 | 0.035 | 0.047 | 0.134 | 0.006 | 0.000 | 1.000 | 0.048 | 0.010 | 0.128 | 1.000 | -0.011 | 0.091 |
| in_center | 0.040 | 0.038 | 0.077 | 0.058 | 0.093 | 0.046 | 0.362 | 0.293 | 0.093 | 0.030 | 0.048 | 1.000 | -0.031 | 0.118 | 0.215 | 0.065 | 0.053 |
| motif_hmm_650 | 0.169 | 0.101 | 1.000 | 0.055 | -0.067 | 0.357 | 0.014 | 0.017 | -0.144 | 0.061 | 0.010 | -0.031 | 1.000 | 0.100 | 0.088 | -0.155 | 0.038 |
| rat | 0.048 | 0.057 | 0.100 | 0.074 | 0.104 | 0.077 | 0.581 | 0.582 | 0.013 | 0.000 | 0.128 | 0.118 | 0.100 | 1.000 | 0.942 | -0.008 | 0.000 |
| rat_id | 0.110 | 0.085 | 0.088 | 0.126 | 0.126 | 0.157 | 0.356 | 0.326 | 0.021 | 0.000 | 1.000 | 0.215 | 0.088 | 0.942 | 1.000 | 0.017 | 0.053 |
| speed | -0.375 | -0.134 | -0.155 | 0.464 | 0.020 | -0.426 | 0.080 | -0.045 | 0.935 | -0.261 | -0.011 | 0.065 | -0.155 | -0.008 | 0.017 | 1.000 | 0.018 |
| time_point | 0.055 | 0.060 | 0.038 | 0.084 | 0.060 | 0.135 | 0.038 | 0.030 | 0.010 | 0.000 | 0.091 | 0.053 | 0.038 | 0.000 | 0.053 | 0.018 | 1.000 |
| file_name | frame | motif_hmm_650 | centroid_x | centroid_y | in_center | distance | speed | rat | rat_id | group | time_point | Motif | Exclude | Moving_Quickly | Predominant_Behavior | Secondary_Descriptor | Category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 0 | 28 | 73.74916 | 281.14725 | 0.0 | 0.237028 | 5.788066 | Rat1 | A1 | Sham | Baseline_1 | 28 | False | NaN | Rearing | NaN | Exploratory |
| 1 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 1 | 0 | 73.97390 | 279.87573 | 0.0 | 0.277611 | 6.482201 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 2 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 2 | 0 | 74.12506 | 278.42282 | 0.0 | 0.314063 | 7.303471 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 3 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 3 | 0 | 74.03892 | 276.72820 | 0.0 | 0.364817 | 8.381433 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 4 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 4 | 0 | 73.83466 | 274.99896 | 0.0 | 0.374372 | 9.407344 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 5 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 5 | 0 | 73.48978 | 273.12793 | 0.0 | 0.409046 | 10.439452 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 6 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 6 | 0 | 73.04165 | 270.88998 | 0.0 | 0.490712 | 11.718060 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 7 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 7 | 0 | 72.51922 | 268.96298 | 0.0 | 0.429260 | 12.409242 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 8 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 8 | 0 | 71.96824 | 267.16284 | 0.0 | 0.404755 | 12.648870 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| 9 | 22-01-10_Baseline_1_DJL_TABB_cropped_CRF0_0min_to_15min_Rat1 | 9 | 0 | 71.44137 | 265.38058 | 0.0 | 0.399579 | 12.800114 | Rat1 | A1 | Sham | Baseline_1 | 0 | False | False | Rearing | NaN | Exploratory |
| file_name | frame | motif_hmm_650 | centroid_x | centroid_y | in_center | distance | speed | rat | rat_id | group | time_point | Motif | Exclude | Moving_Quickly | Predominant_Behavior | Secondary_Descriptor | Category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 12621950 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26960 | 11 | 376.16336 | 375.05493 | 0.0 | 0.072970 | 2.255560 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621951 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26961 | 11 | 376.07925 | 375.34003 | 0.0 | 0.063913 | 2.191613 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621952 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26962 | 11 | 376.03363 | 375.57294 | 0.0 | 0.051027 | 2.051072 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621953 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26963 | 11 | 376.04352 | 375.74550 | 0.0 | 0.037163 | 1.813118 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621954 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26964 | 11 | 376.11203 | 375.85340 | 0.0 | 0.027476 | 1.515297 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621955 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26965 | 11 | 376.22357 | 375.89578 | 0.0 | 0.025657 | 1.231416 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621956 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26966 | 11 | 376.33826 | 375.87506 | 0.0 | 0.025057 | 0.998283 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621957 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26967 | 11 | 376.38602 | 375.79605 | 0.0 | 0.019845 | 0.811193 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621958 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26968 | 11 | 376.26016 | 375.66530 | 0.0 | 0.039014 | 0.822301 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |
| 12621959 | 22-05-05_Drug_Trt_DJL_TXBY_cropped_CRF0_0min_to_15min_Rat4 | 26969 | 11 | 375.81070 | 375.49000 | 0.0 | 0.103729 | 1.279816 | Rat4 | Y2 | Treated | Drug_Trt | 11 | False | False | Sniffing | Active | Exploratory |